Partitioning Programs for Automatically Exploiting GPU∗

نویسندگان

Eric Petit

Sebastien Matz

Francois Bodin

چکیده

In this paper we explore the use of Astex, a C language partitioning tool, to detect parts of code that can potentially be speeded up by using Graphical Processing Units (GPU).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

OpenCL Task Partitioning in the Presence of GPU Contention

Heterogeneous multiand many-core systems are increasingly prevalent in the desktop and mobile domains. On these systems it is common for programs to compete with co-running programs for resources. While multi-task scheduling for CPUs is a well-studied area, how to partitioning and map computing tasks onto the hetergeneous system in the presence of GPU contention (i.e. multiple programs compete ...

متن کامل

Improving Inter-thread Data Sharing with GPU Caches

The massive amount of fine-grained parallelism exposed by a GPU program makes it difficult to exploit shared cache benefits even there is good program locality. The non deterministic feature of thread execution in the bulk synchronize parallel (BSP) model makes the situation even worse. Most prior work in exploiting GPU cache sharing focuses on regular applications that have linear memory acces...

متن کامل

Data Partitioning Strategy of GPU Heterogeneous Clusters Based on Learning

With the rapid progress of computational science and computer simulation ability, a lot of properties can be predicted by the powerful ability of parallel computation before the actual research and development. With the development of high performance computer architecture, GPU is more and more widely used in high performance computation field as an emerging architecture, and a growing number o...

متن کامل

StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators

GPUs clusters are becoming widespread HPC platforms. Exploiting them is however challenging, as this requires two separate paradigms (MPI and CUDA or OpenCL) and careful load balancing due to node heterogeneity. Current paradigms usually either limit themselves to offload part of the computation and leave CPUs idle, or require static CPU/GPU work partitioning. We thus have previously proposed S...

متن کامل

Work stealing for GPU-accelerated parallel programs in a global address space framework

Task parallelism is an attractive approach to automatically load balance the computation in a parallel system and adapt to dynamism exhibited by parallel systems. Exploiting task parallelism through work stealing has been extensively studied in shared and distributed-memory contexts. In this paper, we study the design of a system that uses work stealing for dynamic load balancing of task-parall...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Partitioning Programs for Automatically Exploiting GPU∗

نویسندگان

چکیده

منابع مشابه

OpenCL Task Partitioning in the Presence of GPU Contention

Improving Inter-thread Data Sharing with GPU Caches

Data Partitioning Strategy of GPU Heterogeneous Clusters Based on Learning

StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators

Work stealing for GPU-accelerated parallel programs in a global address space framework

عنوان ژورنال:

اشتراک گذاری